RNA Secondary Structure Prediction with Simple Pseudoknots

نویسندگان

  • Jitender S. Deogun
  • Ruben Donts
  • Olga Komina
  • Fangrui Ma
چکیده

Pseudoknots are widely occurring structural motifs in RNA. Pseudoknots have been shown to be functionally important in different RNAs which play regulatory, catalytic, or structural roles in cells. Current biophysical methods to identify the presence of pseudoknots are extremely time consuming and expensive. Therefore, bioinformatics approaches to accurately predict such structures are highly desirable. Most methods for RNA folding with pseudoknots adopt different heuristics such as quasi-Monte Carlo search, genetic algorithms, stochastic context-free grammars, and the Hopfield networks, and techniques like dynamic programming (DP). These approaches, however, have limitations. The DP algorithm has worst case time and space complexities of O(n) and O(n), respectively. The algorithm is not practical for sequences longer than 100 nucleotides. In this paper, we present a dynamic programming algorithm for prediction of simple pseudoknots in optimal secondary structure of a single RNA sequence using standard thermodynamic parameters for RNA folding. Our approach is based on a pseudoknot technique for maximizing the number of base pairs proposed by Akutsu (Akutsu 2000). The algorithm has worst case time and space complexities of O(n) and O(n), respectively. We validate the accuracy of our algorithm by experimental results on the entire set of simple pseudoknot collection in the PseudoBase database. Our program folds 163 pseudoknots out of 169 total in the Pseudobase database predicting the structure of 131 pseudoknots correctly or almost correctly. The algorithm is quite efficient. For example, a sequence of 75 nucleotides takes 55 seconds (compared to 20 minutes with the existing software) and a sequence of 114 nucleotides takes 8 minutes (4 hours 30 min). To our knowledge, this is most accurate and efficient algorithm for predicting simple pseudoknots in optimal secondary structure of a single RNA sequence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DP Algorithms for RNA Secondary Structure Prediction with Pseudoknots

This paper describes simple DP (dynamic programming) algorithms for RNA secondary structure prediction with pseudoknots, for which no explicit DP algorithm had been known. Results of preliminary computational experiments are described too.

متن کامل

RNA Secondary Structure Prediction Algorithms

RNA secondary structure prediction is an important problem studied extensively in the past three dacades. However, pseudoknots are usually excluded in RNA secondary structure prediction due to the hardness of examining all possible structures efficiently and model the energy correctly. Current algorithms on predicting structures with pseudoknots usually have extremely high resource requirements...

متن کامل

P-DCFold: an algorithm for RNA secondary structure prediction including all kinds of pseudoknots

Pseudoknots in RNA secondary structures play important roles, but unfortunately, their prediction is a very difficult task. The prediction of a RNA secondary structure still being not resolved, even when it does not contain pseudoknots. Many algorithms have been proposed, but almost of them are not satisfactory in results and complexities. Particularly, when pseudoknots are taken into account, ...

متن کامل

Prediction of RNA pseudoknots by Monte Carlo simulations

In this paper we consider the problem of RNA folding with pseudoknots. We use a graphical representation in which the secondary structures are described by planar diagrams. Pseudoknots are identified as non-planar diagrams. We analyze the non-planar topologies of RNA structures and propose a classification of RNA pseudoknots according to the minimal genus of the surface on which the RNA structu...

متن کامل

K-Partite RNA Secondary Structures

RNA secondary structure prediction is a fundamental problem in structural bioinformatics. The prediction problem is difficult because RNA secondary structures may contain pseudoknots formed by crossing base pairs. We introduce k-partite secondary structures as a simple classification of RNA secondary structures with pseudoknots. An RNA secondary structure is k-partite if it is the union of k ps...

متن کامل

Fast Algorithms for RNA Secondary Structure Prediction

RNA secondary structure prediction with pseudoknots is important, since pseudoknots are part of functionally improtant RNAs in cells. State of the art dynamic programming algorithms due to Akutsu et al [7] and Deogun et al [8] perform well on single RNA sequences. Our aim of this project is to be able to predict secondary structure of real life RNA sequences, which can be more than 700 nucleoti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004